Improving Chinese Sentence Polarity Classification via Opinion Paraphrasing
نویسندگان
چکیده
While substantial studies have been achieved on sentiment polarity classification to date, lacking enough opinion-annotated corpora for reliable t rain ing is still a challenge. In this paper we propose to improve a supported vector machines based polarity classifier by enriching both training data and test data via opinion paraphrasing. In particular, we first extract an equivalent set of attributeevaluation pairs from the training data and then exploit it to generate opinion paraphrases in order to expand the training corpus or enrich opinionated sentences for polarity classification. We tested our system over two sets of online product reviews in car and mobilephone domains. The experimental results show that using opinion paraphrases results in significant performance improvement in polarity classification.
منابع مشابه
Polarity Classification of Short Product Reviews via Multiple Cluster-based SVM Classifiers
While substantial studies have been achieved on sentiment analysis to date, it is still challenging to explore enough contextual information or specific cues for polarity classification of short text like online product reviews. In this work we explore review clustering and opinion paraphrasing to build multiple cluster-based classifiers for polarity classification of Chinese product reviews un...
متن کاملChinese Sentence-Level Sentiment Classification Based on Fuzzy Sets
This paper presents a fuzzy set theory based approach to Chinese sentence-level sentiment classification. Compared with traditional topic-based text classification techniques, the fuzzy set theory provides a straightforward way to model the intrinsic fuzziness between sentiment polarity classes. To approach fuzzy sentiment classification, we first propose a fine-to-coarse strategy to estimate s...
متن کاملSupervised Approaches and Ensemble Techniques for Chinese Opinion Analysis at NTCIR-7
For the opinion analysis task on traditional Chinese texts at NTCIR-7, supervised approaches and ensemble techniques have been used and compared in our participating system. Two kinds of supervised approaches were employed here: 1) the supervised lexicon-based approach, and 2) machine learning approaches. Ensemble techniques were also used to combine the results given by different approaches. B...
متن کاملUsing Morphological and Syntactic Structures for Chinese Opinion Analysis
This paper employs morphological structures and relations between sentence segments for opinion analysis on words and sentences. Chinese words are classified into eight morphological types by two proposed classifiers, CRF classifier and SVM classifier. Experiments show that the injection of morphological information improves the performance of the word polarity detection. To utilize syntactic s...
متن کاملSupervised Approaches and Dependency Parsing for Chinese Opinion Analysis at NTCIR-8
In this paper, we describe our participating system, which is based on supervised approaches and dependency parsing, for opinion analysis on traditional Chinese texts at NTCIR-8. For opinionated sentence recognition, the supervised lexicon-based approach, SVM and Maximum Entropy are combined together. For polarity classification, we use only the supervised lexicon-based approach. For opinion ho...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014